Generalized Analysis of Molecular Variance

نویسندگان

  • Caroline M Nievergelt
  • Ondrej Libiger
  • Nicholas J Schork
چکیده

Many studies in the fields of genetic epidemiology and applied population genetics are predicated on, or require, an assessment of the genetic background diversity of the individuals chosen for study. A number of strategies have been developed for assessing genetic background diversity. These strategies typically focus on genotype data collected on the individuals in the study, based on a panel of DNA markers. However, many of these strategies are either rooted in cluster analysis techniques, and hence suffer from problems inherent to the assignment of the biological and statistical meaning to resulting clusters, or have formulations that do not permit easy and intuitive extensions. We describe a very general approach to the problem of assessing genetic background diversity that extends the analysis of molecular variance (AMOVA) strategy introduced by Excoffier and colleagues some time ago. As in the original AMOVA strategy, the proposed approach, termed generalized AMOVA (GAMOVA), requires a genetic similarity matrix constructed from the allelic profiles of individuals under study and/or allele frequency summaries of the populations from which the individuals have been sampled. The proposed strategy can be used to either estimate the fraction of genetic variation explained by grouping factors such as country of origin, race, or ethnicity, or to quantify the strength of the relationship of the observed genetic background variation to quantitative measures collected on the subjects, such as blood pressure levels or anthropometric measures. Since the formulation of our test statistic is rooted in multivariate linear models, sets of variables can be related to genetic background in multiple regression-like contexts. GAMOVA can also be used to complement graphical representations of genetic diversity such as tree diagrams (dendrograms) or heatmaps. We examine features, advantages, and power of the proposed procedure and showcase its flexibility by using it to analyze a wide variety of published data sets, including data from the Human Genome Diversity Project, classical anthropometry data collected by Howells, and the International HapMap Project.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On generalized topological molecular lattices

In this paper, we introduce the concept of the generalized topological molecular lattices as a generalization of Wang's topological molecular lattices,  topological spaces, fuzzy topological spaces, L-fuzzy topological spaces and soft topological spaces. Topological molecular lattices were defined by closed elements, but in this new structure we present the concept of the open elements and defi...

متن کامل

Molecular diversity within and between Ajowan (Carum copticum L.) populations based on inter simple sequence repeat (ISSR) markers

Study of genetic relationships is a prerequisite for plant breeding activities as well as for conservation of genetic resources. In the present study, genetic diversity among and within 15 Iranian native Ajowan(Carum copticum L.) populations were determined using inter simple sequence repeat (ISSR) markers. Twelve selected primers produced 153 discernible bands, with 93 (60.78%) being ...

متن کامل

Inferences on the Generalized Variance under Normality

Generalized variance is applied for determination of dispersion in a multivariate population and is a successful measure for concentration of multivariate data. In this article, we consider constructing confidence interval and testing the hypotheses about generalized variance in a multivariate normal distribution and give a computational approach. Simulation studies are performed to compare thi...

متن کامل

Comparison of Maximum Likelihood Estimation and Bayesian with Generalized Gibbs Sampling for Ordinal Regression Analysis of Ovarian Hyperstimulation Syndrome

Background and Objectives: Analysis of ordinal data outcomes could lead to bias estimates and large variance in sparse one. The objective of this study is to compare parameter estimates of an ordinal regression model under maximum likelihood and Bayesian framework with generalized Gibbs sampling. The models were used to analyze ovarian hyperstimulation syndrome data.   Methods: This study use...

متن کامل

Evaluation of Genetic Diversity in Iranian Violet (Viola spp) Populations Using Morphological and RAPD Molecular Markers

Recognition of genetic reserves and desirable genes is the basis of breeding programs. So far, in Iran, due to the lack of recognition of genetic resources, a considerable breeding program has not been done on native plants. The study of the genetic diversity of violets as a native plant with ornamental and medicinal uses is the great importance in advancing the breeding goals of this plant. So...

متن کامل

Genetic Diversity of Marrubium Species from Zagros Region (Iran), Using Inter Simple Sequence Repeat Molecular Marker

This study concerns the genetic diversity and taxonomic status of Marrubium species from central and south-west of Zagros region, Iran. It is investigated by Inter-Simple Sequence Repeat analysis. A total of 68 accessions from five Marrubium species were collected from their natural habitats. Molecular analysis was approved with 17 primers, of which 12 were carried out in the reaction mixture. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS Genetics

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2007